Code Similarity on High Level Programs

نویسندگان

  • M. Miron Bernal
  • Hugo Cesar Coyote Estrada
  • Jesús Figueroa-Nazuno
چکیده

This paper presents a new approach for code similarity on High Level programs. Our technique is based on Fast Dynamic Time Warping, that builds a warp path or points relation with local restrictions. The source code is represented into Time Series using the operators inside programming languages that makes possible the comparison. This makes possible subsequence detection that represent similar code instructions. In contrast with other code similarity algorithms, we do not make features extraction. The experiments show that two source codes are similar when their respective Time Series are similar.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Soft Computing Approach to Performance Analysis of Parallel and Distributed Programs

This paper describes a novel approach to performance analysis for parallel and distributed systems that is based on soft computing. We introduce the concept of performance score representing the performance of code regions that is based on fuzzy logic. We propose techniques for fuzzy-based performance classification. A novel high-level query language is designed to support the search for perfor...

متن کامل

Effects of Code Obfuscation on Android App Similarity Analysis

Code obfuscation is a technique to transform a program into an equivalent one that is harder to be reverse engineered and understood. On Android, well-known obfuscation techniques are shrinking, optimization, renaming, string encryption, control flow transformation, etc. On the other hand, adversaries may also maliciously use obfuscation techniques to hide pirated or stolen software. If pirated...

متن کامل

Assessment of genetic diversity in Pythium aphanidermatum isolates using ISSR and rep-CR methods

Sixty isolates of Pythium aphanidermatum as the causal agent of sugar beet root rot were selected on the basis of their geographical origins, morphological and genetic diversity studies. Pathogenicity test using the Kruskal-Wallis analysis showed significant differences between the pathogenic potential of isolates. Based on the growth rate, isolates were categorized in two groups with low and h...

متن کامل

Automatic Rootcausing for Program Equivalence Failures in Binaries

Equivalence checking of imperative programs has several applications including compiler validation and cross-version verification. Debugging equivalence failures can be tedious for large examples, especially for low-level binary programs. In this paper, we formalize a simple yet precise notion of verifiable rootcause for equivalence failures that leverages semantic similarity between two progra...

متن کامل

An evaluation of duplicate code detection using anti-unification

This paper describes an algorithm for finding software clones, which works at the level of abstract syntax trees and is thus conceptually independent of the source language of the analyzed programs. We use a notion of clones which captures replacement of subtrees in the program AST, and is formally based on the notion of anti-unification. This allows us to capture syntactic structural similarit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/0710.5547  شماره 

صفحات  -

تاریخ انتشار 2007